SQuaScheD - Unsupervised Schema Discovery for Heterogeneous Data
We provide on this web site additional material related to the article "Here is the Data. Where is its Schema?" submitted to the 24th International WWW Conference 2015 as submission #312.
You will find below the detailed hierarchies, ground truth class distributions and MDL evolution for hierarchies discovered by SQuaScheD on all datasets mentioned in the paper.
Detailed SQuaScheD Discovered Hierarchies
We present below interactive visualizations showing the most representative attributes and entities for each class of the SQuaScheD discovered hierarchies for each datasets:
Ground Truth Class Distribution
Distribution of the bottom-most ground-truth class in the discovered class hierarchy for all datasets.
ActivityEducationalInstitution
Ground Truth
SQuaScheD
ArchitecturalStructure
Ground Truth
SQuaScheD
Event
Ground Truth
SQuaScheD
Event_NaturalPlace_WrittenWork
Ground Truth
SQuaScheD
Infrastructure
Ground Truth
SQuaScheD
RouteOfTransportation
Ground Truth
SQuaScheD
Species
Ground Truth
SQuaScheD
Tunnel
Ground Truth
SQuaScheD
MDL Evolution in SQuaScheD
The figures below show the evolution of the MDL, class-precision, -recall, and -F2 along the steps of the SQUASCHED process for all datasets.
ActivityEducationalInstitution
ArchitecturalStructure
Event_NaturalPlace_WrittenWork
Event
Infrastructure
RouteOfTransportation
Species
Tunnel